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ABSTRACT 

We have used a stepwise increase in ligand 
complexity approach to estimate the relative contri- 
butions of the nucleotide units of DNA containing 
7,8-dihydro-8-oxoguanine (oxoG) to its total affinity 
for human 8-oxoguanine DNA glycosylase (OGG1) 
and construct thermodynamic models of the 
enzyme interaction with cognate and non-cognate 
DNA. Non-specific OGG1 interactions with 10-13 nt 
pairs within its DNA-binding cleft provides approxi- 
mately 5 orders of magnitude of its affinity for DNA 
(AG° approximately -6.7 kcal/mol). The relative con- 
tribution of the oxoG unit of DNA (AG approximately 
-3.3 kcal/mol) together with other specific inter- 
actions (AG approximately -0.7 kcal/mol) provide 
approximately 3 orders of magnitude of the 
affinity. Formation of the Michaelis complex of 
OGG1 with the cognate DNA cannot account for 
the major part of the enzyme specificity, which lies 
in the k cat term instead; the rate increases by 6-7 
orders of magnitude for cognate DNA as 
compared with non-cognate one. The k cat values 
for substrates of different sequences correlate 
with the DNA twist, while the K M values correlate 
with AG of the DNA fragments surrounding the 
lesion (position from -6 to +6). The functions for 
predicting the K M and k cat values for different 
sequences containing oxoG were found. 

INTRODUCTION 

A great wealth of structural data on the proteins that 
interact with DNA or RNA is currently available. It is 



now clear that both the structures of free proteins and 
nucleic acids and their conformational changes upon 
binding play important roles in the recognition of 
specific DNA and RNA sequences by proteins [(1^1) 
and references therein). However, X-ray structural 
analysis of protein-nucleic acid interactions does not 
provide quantitative estimates of the relative contribution 
of individual contacts or different types of contact to the 
total affinity of an enzyme for its nucleic acid target. 
To carry out a detailed quantitative analysis of energy 
of protein-nucleic acid interactions we have developed 
an approach of stepwise increase in ligand complexity 
(SILC), which provides information on the relative con- 
tributions of each nucleotide and its structural elements 
to the total enzyme affinity for DNA ligands [reviewed 
in refs (3,5)]. These ligands can be either cognate (i.e. con- 
taining specific features recognized by the particular 
protein, such as a recognition sequence of a restriction 
endonuclease, or a DNA lesion for DNA repair 
proteins) or non-cognate (general DNA containing no 
such specific features). Using the SILC approach, we 
have studied a number of DNA-dependent enzymes 
including those not specific for DNA structure or 
sequence such as Escherichia coli RecA (6), specific for 
DNA structure but not for sequence such as DNA poly- 
merases of prokaryotes, eukaryotes, viruses, and archaea 
(3,7,8) and human DNA ligase I (3), specific for DNA 
damage such as human uracil DNA glycosylase (UNG) 
(9,10), E.coli 8-oxoguanine DNA glycosylase (Fpg) 
(11,12), and human apurinic/apyrimidinic (AP) endo- 
nuclease (APEX1) (13), and specific for DNA sequence 
such as EcoRl restriction endonuclease (14), human topo- 
isomerase I (15,16), and HIV integrase (17). We have 
shown that all these enzymes recognize DNA by 
forming multiple additive contacts with all DNA units 
covered by the protein globule (7-20 depending on the 
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enzyme), and that the total interaction is a combination of 
weak electrostatic and hydrophobic and/or van der Waals 
interactions of the enzyme with the individual structural 
elements of DNA, which can be described by the following 
equation: 

K d [d(pN)J = K d [(Pi)] x e-xh A "xh c ' xh^xh^ 

where K d [(P,)] is the dissociation constant (K d ) for the 
minimal or?/?ophosphate ligand, e is the electrostatic 
factor reflecting the increase in enzyme affinity due to 
the interaction with one internucleoside phosphate 
group, h are hydrophobic factors for A, C, G and T nu- 
cleotide bases, the numbers of which in d(pN)„ are a, c, g 
and t, respectively. This equation describes the interaction 
of non-cognate single-stranded (ss) and double-stranded 
(ds) DNA with any enzyme. Only the values of e and h 
factors and ^ d for or?/io-phosphate change from one 
enzyme to another [reviewed in ref. (3,5)]. 

The high affinity of the enzymes for DNA is mainly (5-8 
orders of magnitude) provided by multiple weak, additive 
interactions they form with various structural elements of 
the nucleotides under the footprint (3,5). The relative con- 
tribution of specific interactions to the total affinity of the 
enzymes for DNA is rather small, not exceeding 1-2 
orders of magnitude. The specificity of the enzyme is 
provided by the enzyme-dependent stages of adjustment 
of DNA conformation to one optimal for catalysis and by 
the chemical step of catalysis: k cat is higher by 4-8 orders 
of magnitude for cognate DNA than for non-cognate 
DNA. 

Oxidized DNA bases are an important source of muta- 
tions, playing a crucial role in human mutagenesis, car- 
cinogenesis and aging (18). Mutagenic 7,8-dihydro- 
8-oxoguanine (oxoG) is easily generated in human cells. 
In human cells, oxoG is removed from DNA by 
oxoguanine-DNA glycosylase (OGG1). The enzyme has 
two catalytic activities, acting as a DNA glycosylase 
excising oxoG and as a lyase eliminating the 3'-phosphate 
at the resulting abasic site (19). The substrate specificity 
of OGG1 has been described (19,20), and the enzyme's 
mechanism has been analyzed using the stopped-flow 
approach (21,22). The kinetic scheme for OGG1 process- 
ing an oxoG:C-containing substrate was found to include 
at least three fast equilibrium steps followed by two 
slow, irreversible steps and then another equilibrium 
step. OGG1 is not homologous to Fpg, the bacterial 
oxoguanine-DNA glycosylase, and belongs to the endo- 
nuclease III superfamily (23,24). X-ray structure of OGG1 
and its complexes with DNA also lacks any similarity with 
Fpg (25-28). Thus, it is of interest to study the thermo- 
dynamic aspects of oxoG recognition by OGG1 and to 
compare it with the data available for Fpg. 

To dissect the mechanisms of substrate specificity of 
OGG1, we used the SILC approach to probe the inter- 
action of the enzyme with a series of ss and ds non-cognate 
and cognate oligodeoxyribonucleotides (ODNs), and 
analyzed the results using a thermodynamic model of 
non-specific and specific DNA recognition. Since the 
rate of the reactions catalyzed by OGG1 can depend on 
the DNA sequence, we have also addressed a possible 



effect of different conformational and physicochemical 
properties of cognate DNAs on the K M and k cat param- 
eters of the reaction. 



MATERIALS AND METHODS 

Enzymes and oligonucleotides 

Isolation of the recombinant OGG1 from the E.coli 
BL21(DE3) overproducing strain was described previous- 
ly (21). The concentration of the active form of the enzyme 
was determined in burst-phase kinetic experiments as 
in (29). Sequences of the ODNs used in this work are 
given in Tables 1 and 2. The ODNs were prepared 
from commercially available phosphoramidites (Glen 
Research, Sterling, VA, USA) on an ASM-700 synthesizer 
(BIOSSET, Novosibirsk, Russia). All ODNs were homo- 
geneous as judged by ion-exchange and reversed-phase 
chromatography and electrophoresis in 20% polyacryl- 
amide gel (PAGE) containing 89 mM Tris-borate 
(pH 8.3), 2mM EDTA and 8 M urea. ODNs used as sub- 
strates were 5'-labeled using y[ 32 P]ATP (Radioisotope, 
Moscow, Russia) and phage T4 polynucleotide kinase 
(New England Biolabs, Beverly, MA, USA) according to 
the manufacturer's protocol and annealed to the appro- 
priate complementary strand. 

Enzyme kinetics 

The reaction mixtures (10-20 ul) contained 50mM Tris- 
HC1 (pH7.5), 100 mM NaCl, 1 mM EDTA, and 1 mM 
dithiothreitol; the concentrations of ODN substrates 
were varied. In the inhibition experiments, the standard 
substrate was used in a fixed concentration (22 nM, 
2 x K M , see below), and ODN inhibitors were used at 
various concentrations depending on their affinity for 
the enzyme. All measurements were performed in the 
linear regions of the time courses and enzyme concentra- 
tion dependences of the reaction rate. The reactions were 
initiated by adding OGG1 (0.5 nM), incubated at 37°C, 
and quenched after 2-10 min by addition of NaOH (final 
concentration, 100 mM). These conditions correspond to 
the linear part of the rate dependences on the enzyme 
concentration, and the time of incubation was in the 
linear range of the time courses. The mixtures were 
heated for 2 min at 95°C, supplemented with an equimolar 
amount of HC1 and 0.5 volumes of loading dye (0.25% 
xylene cyanol, 0.25% bromphenol blue, 20 mM EDTA 
in formamide). Reaction products were separated by 
PAGE and quantified by phosphorimaging (Molecular 
Imager FX system, Bio-Rad Laboratories, Hercules, 
CA, USA). 

The K M and V max (k cat ) values were calculated by 
least-squares nonlinear regression fitting using SigmaPlot 
v8.0 software (SPSS, Chicago, IL, USA). The type of in- 
hibition and the inhibition constants (Ki) for a subset 
of ODN inhibitors were obtained using different concen- 
trations of the ODNs from the Lineweaver-Burk plot 
(30). Iso values were determined from a range of inhibitor 
concentrations (0.1-10 x I 50 ) at the concentration of the 
standard substrate equal to 2 x K M . Errors in the I 50 
values were within 20%. K { values were estimated from 
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Table 1. Affinity of OGG1 for minimal ligands, single- and double-stranded homo-ODNs and mixed-sequence ODNs 



Ligand 11 


K„ M b 




Ligand 


K h M 


Ligand 


K h M 




Ligand 


K,, M 




Ligand 


K h M 




P, 


68.0 x 10" 




dGMP 


6.5 x 10~ 3 


oxo-dGMP 


1.6 x 10" 




T 6 :A 6 


3.9 x 10" 




Ti4:A 14 


1.0 x 10" 




dAMP 


1.5 x 10 




dTMP 


2.0 x 10~ 3 


dCMP 


9.4 x 10" 




T 8 :A 8 


5.7 x 10" 




Ti6:A 16 


1.0 x 10" 




A 2 


4.4 x 10" 




T 2 


4.8 x 10~ 3 


c 2 


9.0 x 10 




Tio^Aio 


6.8 x 10 




T2o:A 2 o 


9.0 x 10" 










T 4 


7.3 x 10~ 4 


c 4 


1.5 x 10" 


-3 




3.7 x 10" 


-5 




1.0 x 10" 


-5 


A 6 


3.7 x 10" 


-4 


T„ 


3.7 x 10~ 4 


Q, 


6.0 x 10" 


-4 


OGll-compll 


K M = 1. 


1 x 10- 8 


G 1 1 :comp 1 1 


8.7 x 10" 


-6 


As 


9.0 x 10" 


-5 


T 8 


1.1 x 10~ 4 


c 8 


1.3 x 10" 


-4 














Ag 


7.0 x 10" 


-5 


I'm, 


7.7 x 1<T 5 


C id 


6.7 x 10" 


-5 














Aio 


9.3 x 10" 


-5 


Tu 


5.3 x 10~ 5 


C, 2 


3.3 x 10" 


-5 




















T, 2 


3.0 x 10~ 5 


C14 


3.7 x 10" 


-5 














A M 


3.7 x 10" 


-5 


T l4 


5.7 x 10~ 5 


Gll c 


2.0 x 10 


-5 














A 16 


2.0 x 10" 


-5 


Tk, 


4.0 x 10~ 5 


OGll c 


8.0 x 10 


X 














A20 


4.3 x 10" 


-5 


1 2(1 


2.3 x 10~ 5 




















A23 


2.0 x 10" 


-5 


1 2 ;, 


1.7 x 10~ 5 





















! 'The Kj values estimated from the Lineweaver-Burk plot are given in bold, all other K, were calculated from / 50 values. 
b Mean of three independent experiments; the error did not exceed 15%. 

The sequence of Gil and OG11 is CTCTCCCTTCXCTCCTTTCCTCT, where X = G or oxoG, respectively. The sequence of compll is 
AGAGGAAAGGAGCGAAGGGAGAG. 



the equation for competitive inhibition, v = V max x [S]/ 
[[S] + A: M x (1 + [I]/Xi)]; it is easy to see that / 50 = 3 x K { 
at [S] = 2 x Kyi. 

AG° values for contribution of various DNA elements 
into binding energy and AAG° values for differences 
between the energy contributions in different substrates 
were calculated from the equation AG° =-RTx\n K d 
(30) for T= 310K. 

Correlation analysis of the kinetic data 

To reveal possible correlations between ODN sequences 
and kinetic parameters, we have used ACTIVITY 
software specifically designed for analysis and prediction 
of conformational and physicochemical properties of 
various DNA sequences (31). We have customized this 
program, which was originally used for fully dsDNA, to 
predict the properties of DNA containing mismatches. To 
do this, it was necessary first to analyze independently 
each of two strands of fully dsDNA. The analysis was 
restricted to the string {e_i 0 - ■ -e 0 . . .e+\ 0 } (written simply 
as {e_io- ■ -e+io} for fully complementary ODNs) corres- 
ponding to a 21-mer ds fragment of the ODNs (Table 2, 
see below), where e 0 is G instead of oxoG in the modified 
strand, or <? 0 is C opposite oxoG in the complementary 
strand (henceforth, the upper index 0 will be used 
to describe the modified strand, and the upper index 
'#' will be used for the complementary strand). Due 
to the presence of mismatches in some ds ODNs, 
{e°_io- ■ -G. . .e°+io/e # -io- ■ C. . .e # +i 0 }, the predictions for 
a ds ODN based on individually treated strains can be 
different. Therefore, to match these interim predictions 
we have used a heuristic approximation of the 'limiting 
stage' using the equations: 

k ca Ae°_ l0 ...G...e° +l0 /e # _ l0 ...C...e # +m } 
= MIN(fc cat { <? ° 10 . . . G . . . e° +l0 }; k, Al {et l0 . . . C . . . e* 10 }), 

(1) 



k M {e\...G...e° +l0 /et w ...C...e# l0 } 
= MIN(/c M {e° 10 . . . G . . . e° +m }; k M {e\ . . . C . . . ef 10 }), 

(2) 

To maximize the diversity of the analyzed sequences, we 
have used a learning set encompassing ds ODN1-ODN6, 
the strands of which contain all variants of CT and AG 
permutations and A„, T„, and (C/G)„ runs encountered 
in the complete set of substrates (Table 2). At the 
final stage of the analysis, the partially mismatched 
ds ODN13-ODN15 were used as typical examples of 
ODNs containing mismatches. Fully complementary 
ODN7-ODN12 and mismatched ODN16-ODN21 were 
used as an independent control set to test the predictions 
made from the learning set. 

Finally, we have searched for possible correlations 
between the experimentally determined K M or k cat values 
with P g; [a;b] indexes of average conformational and 
physicochemical properties (l<g<38) of a DNA helix 
from the ACTIVITY database (freely available 
at: http://srs6.bionet.nsc.ru/srs6bin/cgi-bin/wgetz?-page+ 
LibInfo+-lib+PROPERTY, the PROPERTY table) (32) 
between positions a and b of the ds ODNs (— 10 < 
a<6<+10): 

Pq;[a;b]{e°_ 10 • • . 6+lo} = HP a<i<h- 1 ^/{eie i+ i /(b - Cl). 

(3) 

Table 3 shows two examples of conformational and 
physicochemical properties of a B-DNA helix in the 
ACTIVITY database, which includes a total of 38 differ- 
ent properties such as size (step, width, groove depth, etc.), 
angles (twist, opening, tilt, etc.); melting point, flexibility, 
entropy, enthalpy, Gibbs free energy and other (32). 
Altogether there are 38 x 21 x 20/2 = 7980 variants of 
P g; [a;b] f° r ds DNA containing 21 bp. 

We have correlated every P q -[ a \b\ with -^m an d k cat using 
bootstrapping (33) on seven sets using 6 of 12 chains of ds 
ODN1-ODN6 (Table 2): (i) minimal P q - [aM \ (h) maximal 
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Table 2. Kinetic parameters of 8-oxoG excision from DNA substrates of different sequence 



ds ODN ID 


ss ODNs ID 


Structure and numbering of ds ODNs" 

+10 0 -12 


Number of 
mis-matches 


K M (nM) b 


k CM (min ') 


ODNl c 


OG1 1 
compl 1 


3 ' - GAGAGGGAAGCGAGGAAAGGAGA - 5 ' 


o 


1 1 ± 3 


1.0 ± 0.1 


ODN2 


2 
2c 


c;' - rTr i TrrrT r rrYr r rr , PTr r rr i r i Tr"r- V 

J L1L1LLL11 LAL -LLL1L1LL1' L — J 

3 ' - GAGAGGGAAGCGAGGAGAGGAGA - 5 ' 


o 


23 ± 4 


1.4 ± 0.1 


ODN3 


j 

3c 


3 ' - GAGAGGGGAGCGAGGAGAGGAGA - 5 ' 


n 

u 


21 =1= 5 


1 1 ± 0 1 


ODN4 


4 
4c 


c;'_prrr ,r rpppp r T i PYP | T | pp r r r r r rpp r rPT i -l' 

D L1L1LLLI L LAL 1LL1 1 1LL1 1 L O 

3 ' - GAGAGGGGAGCGAGGAAAGGAGA - 5 ' 


n 

u 


?0 ± 4 


0 0 + 01 
\j.y x u.i 


ODN5 


5c 


[; / _r i Tir i frpprp r r r PPYP r PPPTTTPP r rP'T 1 -^' 
J L1L1LL111 LAL J. LL 1 1 1 LL 1 ( L O 

3 ' - GAGAGGAAAGCGAGGAAAGGAGA - 5 ' 


o 


16 ± 4 


1.1 ± 0.1 


ODN6 


6c 


r'-aaaaaaaa TxcvccnnnrTTTTTT' - V 

J rirlririnnnnnLALLLLLLLLLuLu J 

3 / _ TTTTTTTTTGCGCGGGCGGGCGC - 5' 


o 


26 ± 5 


1.2 ± 0.1 


ODN7 


7 
7c 


C ' mmmmmmrnrnrp,^ mmmmmmmmm rr>fT> O' 

3 -111111111 LAL 11111111UI~j 

3 ' - AAAAAAAAAGCGAAAAAAAAAAA - 5 ' 


n 
u 


1 1 =1= 3 


1 3 it 0 1 


ODN8 


8c 


5 — ^^^'^'PTTTTTXTTTTTTTTTT TT~ 3 
3 ' - AAAAAAAAAAC AAAAAAAAAAAA - 5 ' 


o 


29 ± 6 


0 96 + 0 07 


ODN9 


9 

9c 


c.'-PPfr'Pr'ppr'PYP AAAAAAAAAAA-V 
J LLLLLLLLLLALnrinAnnririrlrin J 

3 ' - GGCGGGCGCGCGTTTTTTTTTTT - 5' 


o 


64 ± 8 


2.9 ± 0.2 




i n 

1 u 
10c 


D 1111111 LLLAbuL lllllllll j 

3' - AAAAAAACCCCCCCAAAAAAAAA- 5' 


n 

u 


66 ± 12 


1 0 ± o 1 


ODN1 1 


j [ 
lie 


q'_nAr , r , r , Ar , r , r , r , vr , r'r , PAPr , PAPr , r i — ^' 

D LALLLiiLiLLiLALLLLiiLLL/iLLLi ~> 

3'-CTCGCTCGCGCGCGCTCGCTCGC-5' 


n 

u 


78 ± 8 


2 4 + 01 


ODN 12 


1 2 
12c 


^'-PTipTPPPTTPYaTPPTTTPPTPT-T' 
J L1L1LLL1-L L Art. J.LL111LL1L1 ~> 

3 ' - GAGAGGGAAGCTAGGAAAGGAGA- 5' 


o 


41 ± u 


1.2 ± 0.1 


ODNH 


OG1 1 

12c 


R'-PTPTPPPTTPYPTPPTTTPPTPT-^' 
J L1L1LLL11 LAL 1LL1 1 ±LL1L1 J 

3 ' - GAGAGGGAAGCTAGGAAAGGAGA - 5 ' 


1 


22 ± 2 


0 94 + 0 1 0 




7 

8c 


D "lllllllll LAL 11111111111 J 

3 '- AAAAAAAAAAC A AAAAAAAAAAA - 5 ' 


2 


390 ± 20 


1 3 ± 0 4 


ODN 15 


g 

10c 


5 — ^^'^'T^'P'PTT'X'Ta'TTTTTTTTTT TT~ 3 
3'-AAAAAAACCCCCCCAAAAAAAAA-5' 


f. 


400 ± 50 


0 40 + 0 0? 


v y ly i n iu 


7 

10c 


c' rnmrnrnmmmmm/nvprprprTimrnmrnrpmmfTi O' 
J LAL 1111J.J.J.J.J.J.1 J 

3'-AAAAAAACCCCCCCAAAAAAAAA-5' 




310 ± 50 


0 48 + 0 07 


ODN 17 


8 
7c 


3 ' - AAAAAAAAAGC G AAAAAAAAAAa - 5 ' 


2 


70 ± 10 


1.7 ± 0.05 


ODN 18 


OGll 

7c 


5'-CTCTCCCTTCXCTCCTTTCCTCT-3' 
3'-GAGAGGGAAGCGAGGAGAGGAGA-5' 


1 


16 ±7 


0.80 ± 0.10 


ODN 19 


OGll 

3c 


5'-CTCTCCCTTCXCTCCTTTCCTCT-3' 
3'-GAGAGGGGAGCGAGGAGAGGAGA-5' 


2 


59 ± 14 


1.9 ± 0.1 


ODN20 


OGll 

4c 


5'-CTCTCCCTTCXCTCCTTTCCTCT-3' 
3 ' - GAGAGGGGAGCGAGGAAAGGAGA - 5 ' 


1 


22 ± 6 


1.4 ± 0.1 


ODN21 


OGll 

5c 


5'-CTCTCCCTTCXCTCCTTTCCTCT-3' 
3'-GAGAGGAAAGCGAGGAAAGGAGA-5' 


1 


8.0 ± 4.0 


1.4 ± 0.1 



' l X = oxoG. Mismatched nucleotides are underlined. The 21-nt stretches shown in capital letters without two nucleotides that are italicized were used 
for correlating physicochemical properties of the ODNs with K M and A; cat . 
h K M and k c . M values are given as mean ± SEM of 3-5 independent experiments. 
c ds ODN1 is identical to OGllxompll. 



P g; [a;b]l (iii) the P q; [ a ;b] value nearest to the mean for 
ds ODN1-ODN6; (iv) minimal K M or k cat values; (v) 
maximal K M or £ cat values; (vi) the K M or £ cat values 
nearest to the mean for ds ODN1-ODN6; (vii) minimal, 
maximal, and nearest to the mean values of P q -[ a -b] and K M 
or /c cat for ds ODN1-ODN6. For each of these seven sets 
of P q ;[a;b] an d or ^'cat pairs, 11 statistical criteria 
were checked: (i) linear correlation; (ii) Spearman's rank 
correlation coefficient, (iii) Kendall x rank correlation 
coefficient; (iv) Pearson's x 2 an d (v) Fisher's exact tests 
for 2x2 contingency tables; (vi) and (vii) uniformity of 
distribution of P q; [ a ;b\ an d or /c cat ; (viii) and (ix), nor- 
mality of distribution of the residuals of K M or fc cat after 
a direct regression K M = k x [P q; [ a; b] — p] (or & cat = k x 
[Pq-,[aM ~ P]) or °f Pq;[a-,b} after the inverse regression 
Pq:[a-h] = P x [K M - k] (or P q . [a . b] = px [k cat - k]); (x) and 



(xi); sign test for A = P q; [ a; b] - P x [^m - k] (or A = 
P q -,[a;b\ - P x [k cat - k]) and A' = Km — k x [P q - [c ,-b] - p] (or 
A' = &cat — K x [P q ; [aM~ Pi)- ^he analysis of each of the 
eleven criteria (m) using each of these seven sets of data 
(n) produces 1 1 x 7 = 77 levels of statistical significance 
(a fflj „) of paired conformity between the context-dependent 
calculated parameters P q -[ a -b] for the strands of ds ODN1- 
ODN6 (Table 2) and the experimentally determined K M or 
k cai values for these ODNs. The levels of significance a m> „ 
were analyzed using Zadeh's fuzzy logic (34) by their rep- 
resentation on a scale of 'fuzzy decision' from — 1 to +1 
on the basis of a heuristic rule: significant positive deci- 
sion (a„ v , < 0.05), insignificant or negative decision 
( a m,« > 0.05); detection threshold a„ M1 = 0.05. Thus, the 
decision scale q m ,n{P q ;[a;b\) was a ssigned the constant 
value of 1 in the case of oc m> „(P 9; [ a;il ])<0.01, the opposite 
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Table 3. Examples of conformational and physicochemical properties 
of all sixteen possible dinucleotide steps of a DNA helix in the 
ACTIVITY database (32) 



MI P0000011 11 




MI P0000038 




MN Conformational 




MN Physicochemical 


ML dinucleotide step 


ML Dinucleotide step 


RN Suzuki et at (52) 


RN Sugimoto 


et al. (53) 


PN Twist angle 




PN Free enerj 


;y change 


PM Average from X 


-ray structures 


PM Calculated 


PU Degree 




PU kcal/mol 




AA 35.3 


GA 40.3 


AA -1.2 


GA -1.5 


AT 31.2 


GT 32.6 


AT -0.9 


GT -1.2 


AG 31.2 


GG 33.3 


AG -1.5 


GG -2.1 


AC 32.6 


GC 37.3 


AC -1.5 


GC -2.3 


TA 40.5 


CA 39.2 


TA -0.9 


CA -1.7 


TT 35.3 


CT 31.2 


TT -1.2 


CT -1.5 


TG 39.2 


CG 36.6 


TG -1.7 


CG -2.8 


TC 40.3 


CC 33.3 


TC -1.5 


CC -2.1 



a MI, number of entry in the database; MN, type of the property 
(conformational or physicochemical); ML, elementary unit for the 
entry (a nucleotide or a dinucleotide step); RN, reference to the 
source of the data; PN, property name; PM, type of primary data; 
PU, units of measurements for the given property. Examples of a con- 
formational property (twist angle, left column) and a physicochemical 
property (AG°, right column) of dinucleotide steps are given. 
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Figure 1. The scale of 'fuzzy decision' of Zadeh fuzzy logic (34) 
given by projection of the output of the mth statistical test on the 
nth data set (the level of statistical significance ol„,„) onto an interval 
[—1,1] with the fuzzy decision threshold q,„„ = 0, assigned to the 
statistical threshold a mn = 0.05, separating the positive fuzzy decision 
Qm.n >0 in the case of a statistically significant result (a mi „ <0.05) from 
the negative fuzzy decision q,„ „ <0 in the case of an insignificant result 
(«„,,„ >0.05). 



constant value —1 in the case of <z m ,„(P q -\ a \b\) > 0-1 an d 
between them, 0.01 < a. m ,n{Pq;[a-b]) < 0.1, by the square 
approximation; 

1m,n(Pq,[a,b]) = 1.3 — 28.3 X Ot m „{P q [„£,]) + 55.6 

2 (4) 

X CCm f n\Pq,[a,b]) ■ 

The decision scale described by Equation (4) is shown in 
Figure 1. As one can see, this dependence is a parabola 
defined by three points: the common decision threshold 
(a„, „ = 0.05; q mM = 0) and two points of continuity: 
(<V» = 0.01; q„ K „ = 1) and (oc„„„ = 0.1; q nu „ = -1). Every 
evaluation of P qi [ a ;b\ properties of ds ODN1-ODN6 
(Table 2) generates 77 fuzzy decisions q m>n (f 'qfab]) on the 
significance of their correlation with K M or /c cat values for 
these ODNs. All 77 fuzzy decisions were finally averaged 
according to the theory of additive decision making (35): 

Qm?i(Pq,[a,b\) = l<m<ll 1<«<7 qm,n(Pq,[a,b])} /(^ 1 X 7). 

(5) 

Equation (5) gives the final decision Q(P q: [ a -/,]) about the 
correspondence between the given property of ds DNA 
helix and the experimentally measured values of K M or 
kcnt for this DNA. Maximal positive Q(P CJ ;[a;b]) indicates 
which property P q over the [a;b] stretch of DNA helix, of 
all 7980 variants produced by Equation (3), best correlates 
with the measured K M or k cat . 

RESULTS 

Inhibition of OGG1 by various ligands 

In order to estimate the contribution of various elements 
of DNA structure into the total affinity of OGG1 for 



non-cognate and cognate DNA, we have determined the 
affinity of this enzyme for several ligands of increasing 
complexity, including or/Aophosphate, dNMPs, ss 
homooligomeric ODNs, ss ODNs of mixed sequence, ds 
homooligomeric ODNs and ds ODNs of mixed sequence 
(Table 1 and Supplementary Table). The binding affinities 
were determined from competition experiments using the 
ligands as competitors in the OGG1 -catalyzed reaction of 
cleavage of an oxoG-containing substrate, ds ODN 
OGllxompll (Table 1 and Supplementary Table). Since 
undamaged ss or ds DNA, as well as oxo-dGMP and 
oxoG-containing ODN OG11 in an ss form, are not 
cleaved by OGG1 (19), the inhibition constant may be 
equated with the binding affinity of these ligands for 
OGG1. 

All tested short cognate and non-cognate ss or ds ODNs 
inhibited the reaction of cleavage of oxoG catalyzed by 
OGG1. Using ds OG1 1 as a substrate, several other ODN 
substrates and inhibitors were shown to be competitive 
inhibitors towards it (Figure 2, Table 1). Therefore, K\ 
gives an estimate of binding affinity (K d = Ki) of ODNs 
for the DNA-binding site of OGG1. In the absence of the 
inhibitor, the reaction was characterized by K M = 11 nM. 
Since most of the short ODNs had relatively low affinities 
for OGG1, the K t values were calculated from the IC 50 
values using the equation corresponding to competitive 
inhibition, IC 50 = 3K { at S = 2K M (Table 1). 

The minimal inhibiting ligands of OGG1 were 
orf/zophosphate K { = 68 mM) and different dNMPs, 
which demonstrated affinities (K\ = 1.5-9. 4 mM) compar- 
able with that for oxo-dGMP {K { = 1.7mM, Table 1). 
Consequently, the DNA-binding groove of OGG1 recog- 
nizes all free dNMPs. 




-10 -5 0 5 10 15 20 



1/[ds OG11], |iM" 1 

Figure 2. Analysis of the inhibition type and estimation of A', for 
d(pT)i 0 :d(pA) 10 in the reaction of oxoG excision from ds OG11 
catalyzed by OGG1, using a Lineweaver-Burk plot. The inhibitor 
was used at 0 (line 1), 0.15 (line2), 0.30 (line3) and 0.45mM (line4). 
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Figure 3. Dependencies of log A",- on the length («) of ss (solid line) and 
ds (dashed line) d(pN)„ inhibitors of oxoG excision from ds OG11 
by OGG1. Triangles, d(pT)„; circles, d(pA)„; crosses, d(pC)„; squares, 
d(pT)„:d(pA)„ 



Additive interactions of OGG1 with nucleotide units of 
ss oligonucleotides 

The underlying assumption of S1LC analysis is that the 
Gibbs free energy for complex formation can be expressed 
as a sum of the AG° values for individual contacts: 
AG° = AG° i + AG° 2 + • . . +AG°,„ with AG°,- = -RT x In 
K di , where K di characterizes the contribution of an indi- 
vidual contact (30). Hence, the overall K d value for the 
formation of an enzyme:DNA complex is the product of 
the K d values for the individual contacts: AG° = — ^7" x 
In K d = —RT x ln(A' dl x K d2 x . . . x K dn ). 

To assess whether the energy of interactions of OGG1 
with ODNs is indeed additive in this manner, the data 
(Table 1) were analyzed as log dependences of K\ 
(directly proportional to AG°) for d(pN)„ versus the 
number of mononucleotide units («; Figure 3). The 
linear log dependences for ss d(pN)„ (for 0<»<10; 
n = 0 corresponds to Pi) provide the evidence for the 
additivity of AG° values for the interaction of ten individ- 
ual d(pN)„ units with OGG1, consistent with our earlier 
observations that the protein globules of all relatively 
small DNA-dependent enzymes (30-40 kDa; the molecu- 
lar mass of OGG1 is 39 kDa) interact with 9-1 1 nt units of 
DNA [reviewed in ref. (5)]. 

Values of the / factor (1.78 ± 0.05) reflecting the 
increase in the affinity of OGG1 for various d(pN)„ per 
a unit increase in length, were evaluated from the slopes of 
the linear parts of these curves (Figure 3). The uniform 
increases in K t , corresponding to the interaction between 
the enzyme and one unit of ss DNA, are equal to the 
reciprocals of these factors (K x = l/f = 0.56 ± 0.02 M). 
An extrapolation of all dependences to n = 1 gives 
Ki = 12 mM; this value characterizes the interactions of 
the active center of OGG1 DNA-binding groove with 
one position standing out among the lOnt of any ss 
DNA bound by the enzyme. Notably, the K t values 



experimentally determined for free dNMPs (1.5-9. 4 mM, 
Table 1) are 1.3- to 8.0-fold lower. Such a situation was 
observed earlier for other enzymes (5) and is thought to 
reflect a greater conformational freedom of free nucleo- 
tides as compared with dNMPs in DNA constrained by its 
3'- and 5'-linkages, especially if the enzyme's action 
involves a specific conformational adjustment of the 
target dNMP in the natural substrate, such as flipping of 
the damaged dNMP out of the DNA helix into the active 
site of OGG1 (25). Thus, one distinguished position in the 
DNA-binding groove of OGG1, presumably the active 
site of the enzyme, is characterized by the affinity for 
non-cognate d(pN) within ss DNA (K { = 12 mM) that is 
~47-fold stronger than the interaction of OGG1 with any 
other nucleotide unit of long ss DNA in the groove 
(Ki = 0.56 M). 

The interaction of OGG1 with all units of d(pN)i_i 0 is 
additive, and the K d (K{) values for any ODN can be 
obtained by multiplying K d for the minimal ligand (P ; ) 
by = 1// = 0.56 M for each mononucleotide unit: 

Xd[d(pN)J = Kd[(PQ] x [l/f]" = K d [(dNMP)] 

i (o) 
x [1//F -1 (1 < n < 10). 

In contrast to DNA polymerases, human uracil-DNA 
glycosylase (UNG), AP endonuclease (APEX1), and topo- 
isomerase I (3,7,8,10,13,15,16) but similarly to E.coli 
Fpg and EcoRl endonuclease (11,12,14), the affinity of 
OGG1 for non-cognate d(pC)„, d(pT)„ and d(pA)„ did 
not depend on the relative hydrophobicity of their bases 
(Figure 3). The affinity of a 23-mer mixed-sequence ODN 
Gil (20 (iM) was the same as that for all homo-d(pN) 2 3 
(17-20 uM; Table 1). This means that the enzyme essen- 
tially does not contact nine of the ten DNA bases, but 
mainly interacts with the sugar-phosphate backbone of 
these units. To confirm this and to assess the contribution 
of the sugar-phosphate backbone structural units to the 
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formation of the weak additive contacts between OGG1 
and ODNs, we synthesized an abasic oligomer, (pF) 9 pT, 
where F is an analog of deoxyribose resistant to cleavage 
by OGG1 because its hydroxyl moiety at CI' has been 
replaced by a hydrogen atom. The affinity of (pF) 9 pT 
for OGG1 = 70 ± 14uM) was not different from 
that of regular d(pN) 10 (67-93 uM; Table 1). In addition, 
r(pA) 10 and r(pC) 10 demonstrated the affinity only 1.5- to 
1.9-fold lower (140 uM and 130 uM, respectively) than the 
corresponding d(pN)i 0 . 

A ~ 1.78-fold change in the affinity with d(pN)„ elong- 
ation by 1 nt unit (AG° approximately — 0.34kcal/mol) is 
lower than would be expected for strong electrostatic 
contacts (up to — l.Okcal/mol) or hydrogen bonds (from 
—2 to — 6kcal/mol), but comparable to the values 
characterizing weak ion-dipole and dipole-dipole inter- 
actions (30). According to the crystal structure, OGG1 
possesses a large positively charged groove (25-27). 
Thus, the interactions of the negatively charged inter- 
nucleotide groups of non-cognate ODNs with the 
DNA-binding groove of OGG1 may use electrostatic 
interaction of dipoles rather than point charges and 
resemble the interface between oppositely charged 
surfaces. 

Affinity of OGG1 for non-cognate and cognate 
DNA duplexes 

As the next step in our analysis of OGG1 specificity, we 
have compared the affinities of this enzyme for ds ODN of 
different composition and length. Figure 3 and Table 1 
show that the minimal ligand exhibiting duplex properties 
for OGG1 (i.e. bound by the enzyme differently from 
binding its two strands independently) is d(pT) 8 :d(pA) 8 , 
for which the value of T m (21°C) (36) is lower than the 
temperature at which the reaction was performed (37°C). 
Thus, OGG1 can stabilize short ODN duplexes. A similar 
phenomenon of duplex 'assembly' and stabilization has 
been observed for topoisomerase I and DNA polymerases 
(3,7,8,15,16). In contrast to ss ODNs, the increase in 
log(^Ti) for ODN duplexes was linear up to n = 13-14. 
The affinity of OGG1 for non-cognate homo-ODN and 
mixed-sequence ODN duplexes was ~1.5- to 3.3-fold 
higher than for the corresponding ss ODNs (Table 1). 
The affinities of d(pT)„:d(pA)„ duplexes (n > 8) could be 
formally described by the same Equation (6) as for ss 
ODN, the / factor having decreased from 1.78 ± 0.05 to 
1.60 ± 0.05. Thus, the second strand contributes to the 
affinity of OGG1 for DNA much less than the first one. 
The continued increase in the affinity for ds d(pN)„ 
between n = 10 and 13-14 may be caused by additional 
contacts of the second strand with the enzyme and com- 
plementary interactions between the strands. 

The affinity of the enzyme for mixed-sequence ss and ds 
Gil (20 ± 2 and 8.7 ± 0.9 uM, respectively) is comparable 
with that for ss and ds homo-d(pN),, (34 ± 13 and 
9.8 ± 0.5 uM, respectively, averaged over all sequences 
with n>\ \ and >14, respectively; Table 1). The substitu- 
tion of cognate ss OG11 {K\ = 0.080 uM) for non-cognate 
ss Gil (20 uM) leads to a 250-fold increase in the affinity 
for OGG1. Finally, the affinity of OGG1 for cognate ds 



ODN OG11 was estimated from K M of oxoG excision; 
since k cat of this reaction is quite low, on the order of 
lmin -1 [(19) and see below], the Michaelis complex is in 
a rapid equilibrium with the substrate and K M ~ Kd- The 
affinity of OGG1 for cognate ds OG11 (0.011 uM) was 
higher than for d(pT) 14 _ 2 3:d(pA) 14 _ 2 3 and ds Gil by a 
factor of ~790 (Table 1). Interestingly, the active site of 
OGG1 binds free oxo-dGMP ~7. 5-fold more tightly than 
one non-cognate d(pN)„ unit within DNA only 4.1 -fold 
more tightly than free dGMP (Table 1). Thus, the relative 
contribution of specific interactions of the active center of 
OGG1 with oxoG is higher for oxo-dGMP units within 
cognate DNA than for free oxo-dGMP. 

Thermodynamic models of OGG1 interaction with 
cognate and non-cognate DNA 

Based on the measurements of binding affinity of OGG1 
for various ligands, we have constructed a model 
summarizing the contribution of different interactions to 
DNA binding by OGG1. As mentioned above, extrapola- 
tion of the log(^Ti) dependencies to n = 1 gives a Ki value 
of 12mM, which characterizes the interaction of the active 
site of OGG1 with one of the nucleotides of d(pN)„ bound 
to the protein. From this, the AG° of one non-specific 
dNMP interaction with the active site of OGG1 can be 
estimated as — 2.6kcal/mol. Since 12 phosphate groups of 
one strand of ds d(pN) 13 interact with OGG1 through 
weak, additive contacts, (AG° approximately — 0.34kcal/ 
mol each, as calculated from the /factor), the summarized 
relative contribution of these internucleotide phosphates is 
A G° approximately — 4.1 ± 0.2kcal/mol. A comparable 
value of A A G° approximately — 3.9 ± 0.2kcal/mol may 
be calculated from the ratio of the K\ values for dTMP 
and d(pT) 13 _ 23 (average AG° = -4.0 ± 0.2kcal/mol for 
n = 13-23). Thus, all contacts of OGG1 with 10-14 nt 
(including the dNMP unit with the higher affinity) of 
one strand of DNA provide AG° of -6.7 ± 0.3 kcal/mol 
at most. From the ratio of the average K t values for ss 
and ds d(pN) 13 _ 23 including Gil (2.3- to 3.3-fold) 
characterizing the difference in the affinity of OGG1 for 
ds ODNs compared to ss ODNs, the contribution of the 
second strand to the affinity are only AG° approximately 
— 0.5 ... —0.7 kcal/mol. The X-ray structure of cognate 
and non-cognate OGGl:DNA complexes and 
stopped-flow data indicate that the enzyme distorts any 
DNA, creating a sharp kink, but fails to insert the 
non-damaged base into the active site pocket 
(21,22,25,27). Therefore, the active site of OGG1, as 
defined using the thermodynamic SILC approach, 
probably mostly interacts with the sugar-phosphate 
backbone of a dNMP unit within non-cognate DNA, 
which explains why the affinity of the enzyme for free 
dNMPs is greater than the affinity of its active site for 
dNMP units within long ODNs. Overall, the types of 
OGG1 interaction with non-cognate DNA can be 
summarized by the thermodynamic model shown in 
Figure 4. 

The issue how much binding energy is contributed by 
the lesion itself is less straightforward. In the bacterial Fpg 
protein, which recognizes oxoG but is not related to 
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All contacts of 0GG1 with two strands 
of non-cognate ds DNA 
provide total AG" ~ -6.7 kcal/mol 



Deoxyribose and one Each phosphate, 
phosphate contribution, AG ° ~ -°- 34 kcal/mol 
AG" ~ -2.6 kcal/mol aN phosphates, 




Figure 4. Thermodynamic model of the interaction of OGG1 with 
non-cognate DNA. AG" values characterizing various contacts between 
the enzyme and DNA containing a G base are shown. All types of 
non-specific additive interactions of the enzyme and two strands of 
non-specific DNA provide AG" = —6.7 kcal/mol of total binding energy. 

OGG1 by sequence or structure, the relative contribution 
(approximately —0.8 kcal/mol) of the specific interactions 
of with oxoG unit is nearly comparable at the level of 
minimal dNMP ligands, cognate ss d(pN)„, and ds 
ODNs (11). The same situation was observed for two 
other repair enzymes, UNG and APEX1 (9,13). This 
means that the contribution of specific and non-specific 
interactions of the damaged nucleotides of cognate 
DNA to its total affinity for these enzymes is close to 
additive. On the other hand, since the difference in the 
affinity of OGG1 for non-cognate and oxoG-containing 
ligands increases significantly from free dNMP (4.1 -fold) 
to ss (250-fold) and finally to ds DNA (790-fold, see the 
values for the corresponding pair of cognate and 
non-cognate ligands in Table 1), the ratio of the K\ 
values for dGMP and oxo-dGMP (4.1, corresponding to 
AAG° = —0.8 kcal/mol) may provide only a lower 
estimate of the true contribution of a oxoG unit to the 
affinity of OGG1 for cognate DNA. The upper limit of 
this contribution is given by the 250-fold difference 
(-3.3 kcal/mol) for ss DNA and the 790-fold difference 
(—4.0 kcal/mol) for ds DNA. However, at least some of 
this additional binding energy is most likely provided by 
strengthening of the non-specific contacts between the 
enzyme and the undamaged part of damaged DNA in 
the catalytically competent complex. As OGG1 was 
totally deficient in the removal of oxoG from ss DNA 
(data not shown), one can reason that no conformational 
adjustment takes place in OGG1 complexes with cognate 
ss DNA, and therefore the additional binding energy of 
—3.3 kcal/mol most closely reflects the contribution of an 
oxoG unit when incorporated in the DNA strand 
(Figure 5). 

In contrast to OGG1, at least some cleavage of ss cog- 
nate DNA was reported for Fpg (37) and APEX1 (13), 



All contacts of OGG1 with two strands 
of cognate ds DNA 
provide total AG 0 — 11.0 kcal/mol 

All specific contacts of AH nonspecific contacts of 

oxoG with the active 0GG1 witn phosphates of 

site of OGG1 'he damaged strand, 

AG 0 ~ -3.3 kcal/mol AG ° ~ - 40 kcal/mol 




contacts with the second strand 
plus complementary interactions 
AG" ~ -0.5. ..-0.7 kcal/mol; 
specific interactions decrease AG 0 to — 1 .2 kcal/mole 

Figure 5. Thermodynamic model of the interaction of OGG1 with 
cognate DNA with an oxoG base. Strengthening of the enzyme 
contacts with the damaged and complementary strands of the 
cognate DNA is indicated. The estimated AAG° value characterizing 
the net change in the interactions of all types between non-cognate and 
cognate DNA complexed with OGG1 is -4.3 ± 0.2 kcal/mol. The 
amino acid residues of OGG1 interacting with specific DNA are 
shown after (25). 



whereas UNG actually prefers ss over ds substrates (38). 
Taking into account the 790-fold (AAG° = — 4.0kcal/ 
mol) increase in the affinity of OGG1 for ds oxoGll as 
compared with ds Gil, it is likely that the second strand 
of the cognate ds DNA is more important for 'orbital 
steering 1 by OGG1 than by Fpg, UNG, or APEX1. 
Since the ratio of K\ values for cognate ss and ds OG11 
(~7.3-fold; AAG° = -1.2 kcal/mol) is higher than that for 
non-cognate ss and ds ODNs (2.3- to 3.3-fold), the con- 
tribution of the undamaged strand to the affinity of the 
enzyme for cognate DNA is greater than for non-cognate 
DNA. 

In principle, the 790-fold increase in the affinity may be 
caused by several factors, including strengthening of the 
contacts formed between OGG1 and ss OG11 after the 
addition of the second strand, a change in conformation 
of both strands, or even formation of additional contacts 
of the enzyme with both strands of ds DNA; these 
possibilities cannot be distinguished from the present 
data. However, all non-specific interactions contribute 
approximately AG° = —6.7 kcal/mol, the approximate 
contribution of the oxoG estimated from the increase in 
the affinity from ss Gil to ss OG11 (see above) is 
—3.3 kcal/mol, while the increase in the affinity due to 
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specific interactions of the enzyme with both strands of ds 
OG11, calculated from the difference in the affinity of 
OGG1 for ss and ds cognate and non-cognate ODNs, is 
AAG° = (-1.2kcal/mol)- (- 0.5 kcal/mol) = (-OJkcal/ 
mol). The sum of these energies, —10.7 kcal/mol, is close 
to AG° = —11.0 kcal/mol, the value characterizing binding 
of OGG1 to ds OG11. Overall, the interaction of OGG1 
with cognate ds OG1 1 may be approximately described by 
the thermodynamic model in Figure 5. 

Obviously, formation of a complex between OGG1 
and cognate DNA cannot explain the in vivo high 
enzyme specificity. At the same time, canonical bases 
are not removed from DNA by OGG1 even at very high 
enzyme concentrations and long incubation times, putting 
the estimate of the preference for the damaged base at 
>10 6 - to 10 7 -fold (considering the detection limits of a 
phosphorimaging-based assay). Therefore, £ cat for oxoG 
exceeds that for normal bases by approximately 6-7 
orders of magnitude, suggesting that, similar to other 
investigated enzymes, the adjustment and catalytic stages 
appear significantly more sensitive to the DNA structure 
than the stage of complex formation between OGG1 and 
DNA. Yet, even in the case of completely sequence- and 
structure-independent enzymes, like canonical DNases, 
the rate of the catalysis varies significantly with the struc- 
ture of cleaved sites (39). Therefore, as the next step of our 
analysis, we inquired how local conformational and 
thermodynamic parameters of cognate DNA may influ- 
ence the removal of oxoG by OGG1. 

Effect of substrate DNA structure on the 
reaction kinetics 

The footprint of OGG1 defined by its X-ray structure 
covers 10-11 bp of DNA around oxoG, with the enzyme 
sharply kinking DNA and flipping the damaged 
deoxynucleotide out of the double helix (25,27). The ex- 
tensive enzyme-DNA interactions suggest that the affinity 
of OGG1 for cognate ds DNAs of different sequences and 
the rate of oxoG removal from them may significantly 
depend on the structural features of the substrate. We 
have constructed 21 ds ODNs (ODN1-ODN21) using 12 
ss ODNs containing oxoG (strands 1-12) and 12 com- 
pletely or partially complementary ss ODNs (strands 
lc-12c, resulting in 0-6 mismatches after the annealing) 
containing a C base opposite oxoG and determined the 
K M and k cat values for them (Table 2). The maximal 
affinity (in terms of K M ), 1 1 nM, and a high k cat value 
(1.3 min -1 ) was observed for a completely complementary 
ds ODN7 (T 9 CXCT„, here and below X = oxoG), and 
variation of the sequences flanking the central CXC 
trinucleotide (ODN1-ODN6) produced only a 1.0- to 
2.6-fold increase in the K M values and even smaller 
changes in the k cat values (0.9-1. 4 min -1 ). Only for two 
ds substrates containing a CXC trinucleotide, ODN9 and 
ODN11, the changes in K M and k cat were higher, 5.8- to 
7.1-fold and 1.8- to 2.2-fold, respectively. A replacement 
of the central CXC of ODN7 with a TXT trinucleotide in 
ODN8 slightly increased the K M (2.6-fold) and decreased 
the k cat (1.3-fold) values (Table 2). A stronger decrease in 
the affinity (3.7- to 8.0-fold) was observed for ODN10 and 



ODN12 containing oxoG in the GXG and CXA context, 
but the decrease in k cat values for these ODNs was rela- 
tively small, 1.1- to 1.3-fold as compared with ODN7. The 
ds ODNs used in these experiments contained different 
AA-, TT-, CC-, AT- and GT-rich sequences, different in 
their rigidity and local conformational parameters (40- 
42). The observed moderate changes in the kinetic param- 
eters for ds ODN1-ODN7 indicate that the conformation 
of these ODN in solution exerts little influence on the 
reaction efficiency after their binding to the enzyme. 

Mismatches in DNA facilitate its structural deform- 
ation that can either assist or interfere with its adjustment 
in the enzyme's DNA-binding groove (43). A replacement 
of the complementary T:A pair in ds ODN1 in the 
position +3 with a T:G mismatch (ds ODN20, see 
Figures 4 and 5 for the numeration of the positions) 
increased the K M value twofold, while a replacement of 
the C:G pair with a C:A mismatch in the position +4 
(ODN21) led to an 1.4-fold increase in K M (Table 2); in 
both cases the k cat value became 1.4-fold higher. 
Introduction of a T:G mismatch in the position —6 
(ds ODN 18) decreased the affinity and the rate constant 
relative to ds ODN1 1.5- and 1.3-fold, respectively. The 
maximal 5.4-fold decrease in the affinity and 1.9-fold 
increase in the £ cat value relative to ODN1 was observed 
for ODN19 containing two T:G mismatches in positions 
+3 and -6. 

At the next step, we introduced mismatches in the im- 
mediate vicinity of the lesion where they could affect both 
the local deformability of DNA and the easiness of 
eversion of the damaged base. Interestingly, a replacement 
of two complementary C:G pairs of ODN7 in the pos- 
itions + 1 and —1 with two T:G mismatches resulted in a 
smaller 6.4-fold decrease in the affinity (ds ODN 17) than 
the 35-fold decrease in the affinity observed for ds ODN 14 
containing two C:A mismatches in the same positions 
(Table 2). This observation may reflect the tendency of 
T:G mispairs to form rather stable wobble pairs. Despite 
the higher K M values, the reaction rate was hardly 
affected: relative to ds ODN7, the k cat value for ds 
ODN17 was increased 1.3-fold, while ODN7 and 
ODN 14 demonstrated the same catalytic rate. Only in 
the case of ODN 15 an ODN 16 containing six mismatches 
flanking the lesion (from +1 to +3 and from —1 to —3), the 
coincidental and maximal decrease in the affinity (28- to 
36-fold) and in the reaction rate (2.7- to 3.3-fold) in com- 
parison with ODN7 was observed (Table 2). 

It can be seen that various modifications of the structure 
of ds ODNs can produce different effects on the affinity 
and reaction rate. The introduction of mismatches 3-6 nt 
away from oxoG can either improve or impair the cataly- 
sis by OGG1 in terms of K M and k c . At , whereas the 
mismatches flanking the lesion always decrease the 
affinity of the enzyme for such substrates but only a 
stretch of flanking mismatches can significantly decrease 
both the affinity and the rate of catalysis. Taking this into 
account, it was interesting to analyze what properties of ds 
ODN substrates in solution may be the most important 
for their effective use by the enzyme. 
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Figure 6. Linear correlations between the experimentally measured /f ca , values and either (A) the twist angle averaged over —6 to +6 position of the 
ODNs [Equation (3)] as calculated by the ACTIVITY software or (B) the k CM values predicted by Equations (1), (3) and (7). Open circle and dashed 
line, learning set: (A) each strand of ds ODN1-ODN6 (r = 0.862, oc <0.0005); (B) ds ODN1-ODN12 (r = 0.869, a <0.00025). Closed circle and 
straight line, control set: (A) each strand of ds ODN7-ODN12 (r = 0.835, a <0.001); (B) mismatched ds ODN13-ODN21 (r = 0.667, a <0.05). 



Analysis of correlation between OGG1 kinetics and 
DNA structure parameters 

The adjustment of DNA and enzyme conformation in the 
case of nonspecific and specific repair enzymes includes 
many changes of their initial structure: change of con- 
formation of sugar-phosphate backbone, partial or 
complete melting of DNA, destruction of staking inter- 
actions, formation of kinked conformation, etc. (3). 
Only in the case of cognate DNA having specific structural 
parameters there may be perfect adaptation of enzyme 
and DNA. In order to analyze possible relationships 
between properties of DNA and kinetic constants of the 
OGG1 -catalyzed reaction, we have used the ACTIVITY 
software (31) to correlate these constants with 38 con- 
formational (twist, propeller, bend toward major or 
minor groove, width of major or minor groove, etc.) or 
physicochemical (changes in enthalpy, entropy, Gibbs free 
energy, etc.) parameters of the ds ODN used in our 
experiments. 

The maximal positive decision parameter Q = 0.389 
[Equation (5)] was found for k cat in the case of Pn ; [-6;+6] 
representing the average twist (£2 angle) of the 21 bp of the 
ds ODN helix (Table 3) between the positions —6 and +6 
(the position of the oxoG residue is defined as position 0). 
The second value, Q(Pn-[s-+s]) = 0.065 indicates a lower 
but significant correlation of the twist for a longer stretch 
of the ds ODN helix between the positions —8 and +8. The 
P\i;[-6;+6] estimates calculated for each chain of all ds 
ODNs are given in Fig. 6A. The correlations between 
Pii;[-6;+6] and & cat values for each strand of ds ODN1 
ODN6 (r = 0.862, a < 0.0005) and for the independent 
control set, each strand of ds ODN7-ODN12 (r = 0.835, 
a < 0.001) were statistically significant (Figure 6B), 
producing the following linear regression after pooling 
the data for fully complementary ds ODN 1 ODN 12 
(r = 0.869, a < 0.00025): 

Acatk-lO • ■ ■ e+iol = 0.88 X CPll;[-6;+6]{e-10 . . . e+io} - 33.76. 

(7) 



To analyze correlation of k cat with the twist angle for par- 
tially mismatched duplexes, for which no experimentally 
determined structural parameters is available, we used a 
'limiting stage' approach. Briefly, the twist angle (or any 
other parameter that may be included in the analysis) is 
calculated for two fully complementary duplexes, each 
corresponding to one strand of the mismatched duplex. 
The value of /c cat is then calculated for both duplexes 
using the regression obtained above for fully complemen- 
tary duplexes, and the worst value of the two (i.e. lower 
k cat ) is kept for the correlation analysis. The & cat values 
calculated using Equations (1) and (7) for partially 
mismatched ds ODN13-ODN21 correlate statistically 
significantly (Figure 6B; r = 0.667, a < 0.05) with the 
experimental fe cat values. Overall, A& cat reflecting the 
difference between the experimental and predicted k cat 
values [Equations (1) and (7); AA' cat = k oat -k cat 
{e°_i 0 . . .G. . .e°+io/e # -io- ■ -C. . .e # +i 0 }] varied from -0.47 
to 0.92 for all 21 ds ODNs. These values showed no cor- 
relation with the number of mismatches (r = —0.021, 
oc > 0.925, data not shown), indicating that the 'limiting 
stage' approximation [Equations (1) and (7)] fully de- 
scribes the linearly additive contribution of DNA 
sequence to the reaction rate for both completely and 
partially mismatched duplexes and does not introduce a 
systematic bias in the predicted k cat values. 

For K M values, the maximal positive Q = 0.083 
[Equation (5)] was found for the negative correlation 
with the average AG° (-P38;[-io;+io]) of the analyzed DNA 
sequences (Figure 7A). The correlation was significant for 
both the learning set, each strand of ds ODN1-ODN6 
(;-= -0.832, a < 0.001), and the control set, ODN7- 
ODN12 (r = -0.806, a < 0.0025) (see 'Materials and 
Methods' section for a description of learning and 
control sets). For the pooled set of completely comple- 
mentary ds ODN1-ODN12, the following linear regres- 
sion was found (r = —0.462, a < 0.025): 

A'm{<?-io ...e+i 0 } = 42.68 x CP38;[-io ; +iO]{e-io • ■ -e+io} - 0.86. 

(8) 
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Figure 7. Linear correlations between the experimentally measured K M values and either (A) the Gibbs free energy averaged over —10 to +10 
position of the ODNs [Equation (3)] as calculated by the ACTIVITY software or (B) the K M values predicted by Equations (3), (8) and (9). Open 
circle and dashed line, learning set: (A) each strand of ds ODN1-ODN6 (r = -0.832, a <0.001); (B) ds ODN1-ODN6 and ODN13-ODN15 
(;• = 0.850. a <0.005). Closed circle and straight line, control set: (A) each strand of ds ODN7-ODN12 (r = -0.806, a <0.00025); (B) ds 
ODN7-ODN12 and ODN16-ODN21 (r = 0.862, a <0.0005). Dotted line, all data combined: (A) each strand of ds ODN1-ODN12 [r = -0.457, 
a <0.025; Equation (8)]; (B) ds ODN1-ODN21 (r = 0.806, oc< 0.00001). 



The deviation of the experimental K M values from the 
values calculated using the 'limiting stage' approximation 
(Equations (2) and (7); A.K M = K M -K M {e°_i 0 . . . 
G. . .e 0 + io/e # _i 0 . - .C. . .e # +io}) varied from -10.3 to 
38.3 nM for ODN1-ODN12 and from -27 to 373 nM 
for partially mismatched ODN13-ODN21. Unlike in the 
case of & cat , a statistically significant correlation was found 
between these values and the number of mismatches for 
the set of all 21 ds ODNs (r = 0.814, a < 0.00001, data not 
shown). Therefore, a linearly additive contribution of dis- 
turbance of strand complementarity to the K M value 
exists. Using the data for ODN1-ODN6 and ODN13- 
ODN15, we have introduced a regression correcting 
Equation (2) for the number of mismatches (N^): 

k M {e°_i 0 . . . G . . . e° +m /et w ...C... e* l0 } 
= 72.7 x N^+MAX(K M {e°_ m . . . G . . . c%,} ; (9) 
K M {e*_ m ...C...e* w }). 

The predicted K M values calculated using Equation (9) 
for all 21 ds ODNs are shown in Figure 7B. They are 
statistically significant both for the learning set combining 
ODN1-ODN6 and ODN13-ODN15 (r = 0.850, 
a < 0.005) and for the control set combining ODN7- 
ODN12 and ODN16-ODN21 (r = 0.862, a < 0.0005). 

As an independent test of validity of the 'limiting stage' 
approximation, we have applied it to predict specific AG° 
for one step of DNA helix, 8G°: 

8G°{e°_ l0 ...G...e° +lQ /el l0 ...C...e* l0 } 
= MAX(P 3 8;[-io;+io]{e% . . . G . . . e° +lQ }; (10) 

P38;[-10:+10]{e # io---C...4 10 }). 

The value of 5G° for all ds ODNs calculated using 
Equation 10 (Table 4) statistically significantly correlated 
(r = 0.813, oc< 0.00001) with AG° but not with enthalpy 
(AH°; r = 0.027, a>0.9) or entropy (AS 0 ; r = 0.270, 



oc > 0.2). Earlier, we have found correlation of melting 
temperature of fully dsDNA with the frequency of muta- 
tions induced by 2-aminopurine (44). Here we have 
calculated the melting point of ds ODNs and have 
found its correlation with 8G° (Table 4; r = -0.880, 
oe<10~ 6 ). These data indicate that the use of the 
'limiting stage' approximation [Equations (1), (2) and 

(9) ] is appropriate. 

Finally, Table 4 gives a comparison of predicted §G° 

(10) with the experimental A' cat values and their deviations 
(A/c cat ) from the values predicted from DNA twist 
[Equations (1) and (7)]. One can see that §G° correlates 
with k cat (r = —0.517, a < 0.025) but not with A/c cat 
(r = —0.282, a > 0.2). Therefore, the linearly additive con- 
tribution of 8G° to the k^t value is limited to the internal 
energy of DNA helix twisting, which is a part of §G°. 
Thus, we have found the equations that use DNA 
sequence and helix structural properties to predict the 
k cat and K M values characterizing interaction of OGG1 
with ds cognate 21-mer ODNs. 

DISCUSSION 

The present study shows that the recognition of 
non-cognate DNA by OGG1 is similar to its recognition 
by DNA polymerases (7,8,45), UNG and Fpg DNA 
glycosylases (10-12), APEX1 (13), topoisomerase I 
(15,16), EcoRl (14), HIV integrase (17), RecA protein 
(6), DNA ligase I and RNA helicase p68 (DDX5) (3) in 
terms of additivity of Gibbs free energies. Usually, the 
footprint of 30-40 kDa enzymes, including OGG1, 
covers one helix turn or 10 base pairs of DNA, and their 
active sites bind one singled out cognate or non-cognate 
nucleotide unit with a relatively high affinity. In contrast 
to many enzymes but similarly to UNG and EcoRl 
(10,14), OGG1 forms efficient contacts only with the 
internucleotide phosphate groups of non-cognate ds 
DNA. The / factor (1.78) for OGG1, reflecting its 
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Table 4. Comparison of specific 8G° (AG° for one step of DNA helix) predicted from the K M and the energy parameters of ds ODNs 
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interaction with one internucleotide phosphate (K d 
characterizing the enzyme interaction with one nucleotide 
unit is \jf= 1/1.78 = 0.56 M), is comparable with /factors 
for other enzymes: 1.35 for human UNG; 1.45 for human 
APEX1; 1.52 for Klenow fragment of DNA polymerase I 
from E. coli; 1.54 for E.coli Fpg; 1.61 for human RNA 
helicase p68; 1.3-2.0 for HIV-1 integrase; 2.0 for EcoRl, 
and 2.14 for human DNA ligase I. Similarly to all these 
enzymes, the second strand contributes to the affinity of 
OGG1 for ds DNA much less than the first strand. 

The affinity of OGG1 for ds OG11 (11 nM) is only 
1.8-fold lower than that found for Fpg (6nM) (11). 
Interestingly, the high and comparable affinity of these 
enzymes for ds OG11 is achieved by different means. 
Non-specific interactions with ds Gil contribute to the 
total affinity of Fpg for cognate ds DNA (K^ = 67 nM) 
(11) 130-fold more than do the non-specific interactions 
of OGG1 (8.7 uM, Table 1), while the efficiency of 
non-specific interactions of Fpg with internucleotide phos- 
phate groups of one strand of ss Gl 1 (AT; = 10 uM) is only 
2-fold better that that for OGG1 (K { = 20 uM) (Table 1). 
Thus, the second strand (complementary to Gil) contrib- 
utes to the increased efficiency of non-specific interaction 
of Fpg with ds OG11 to a much greater extent than in 
the case of OGG1. In addition, Fpg does not rely on co- 
operative interactions with the substrate to any extent. 
The contribution of specific interactions of a oxo-d(pG) 
unit and non-specific contacts with other units ds OG1 1 to 
the total affinity are close to additive; the affinity of Fpg 



protein for an oxo-d(pG) unit (~0.1M) is comparable 
for oxo-dGMP, ss OG11 and ds OG11 (11). Like Fpg, 
other enzymes studied by SILC usually do not show 
more than 1-2 orders of the increase in affinity for 
cognate ds ODNs compared with non-cognate ones; 
there is only a 2- to 5-fold difference for APEX1, 7- 
to 10-fold for UNG, 50- to 70-fold for HIV integrase, 
50- to 100-fold for £coRI, and 200- to 250-fold for topo- 
isomerase I (10,13-17). 

In contrast to Fpg (11), the transition from ss to ds 
non-cognate Gil led to only a 2.3- to 3.3-fold increase 
in the affinity. At the same time, cognate ss OG11 
is bound 250-fold stronger than non-cognate ss Gil 
(Table 1). Thus, the formation of specific contacts 
between the oxoG unit and the active site of OGG1 has 
a cooperative effect on the DNA-binding groove of the 
enzyme. Some non-specific, previously weak interactions 
with non-cognate ODNs may greatly strengthen when 
cognate DNA is bound; for example, in the complex of 
DNA with topoisomerase I they attain the character 
of specific interactions (15,16). A very pronounced 
strengthening of non-specific electrostatic interactions of 
HIV integrase is observed even for non-cognate ss d(pC)„, 
which is more flexible structure than ss d(pT)„ (17). The 
highest, 790-fold increase in the affinity was observed for 
ds OG1 1 as compared with ds Gl 1 (Table 1). Therefore, in 
contrast to Fpg, the specific structure of the oxoG unit is 
very important for the productive cooperative changes 
during mutual adjustment of conformations of OGG1 
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and cognate DNA. In conclusion, OGG1 is the first 
example of a DNA-dependent enzyme for which the con- 
tribution of the specific interactions with DNA is close to 
three orders of magnitude. 

All DNA-dependent enzymes studied by the SILC 
approach are highly specific, converting their cognate 
DNA 4—8 orders of magnitude more efficiently than 
non-cognate DNA [reviewed in refs (3,5)]. Formation of 
the enzyme-substrate complex cannot alone explain this 
specificity. For instance, all these enzymes, including 
OGG1, interact with non-cognate RNA-RNA and 
RNA-DNA duplexes with affinities comparable to those 
for DNA-DNA duplexes with the affinity that is only 1-2 
orders of magnitude lower than their affinity for cognate 
DNA-DNA duplexes. Yet these enzymes do not process 
RNA-RNA and RNA-DNA duplexes even at saturating 
concentrations of these ligands. Many data indicate that 
all enzymes acting on long DNA first bind to DNA of any 
sequence in and then slide to the site containing a specific 
sequence or a lesion [reviewed in refs (3,5,46)]. There they 
stop due to increase in the affinity for DNA (estimated 
5- to 790-fold for different enzymes) and can change the 
conformation of the DNA sugar-phosphate backbone in 
multiple ways. Formation of hydrogen bonds between the 
enzymes and the bases of their cognate sites identified by 
X-ray analysis is most probably one of the final stages in 
the selection of the cognate DNA substrates. However, 
only the formation of these bonds can promote the 
proper fit of specific bases into recognition pockets of 
the enzymes, accelerating the reactions by 4-8 orders of 
magnitude. Thus, it is not the stage of complex formation, 
but rather the second (DNA adjustment to the optimal 
conformation) and third stages (the irreversible catalytic 
step), which provide most of the specificity for the 
enzymes involved in DNA replication, DNA repair and 
other processes of DNA metabolism. 

The affinity and rate constant for enzymes recognizing 
specific features in DNA can be influenced by the con- 
formational flexibility, and dynamic behavior of the 
enzyme and DNA, and their capability for specific 
mutual conformational adjustment. Therefore, it was 
interesting to find out which conformational and struc- 
tural properties of cognate DNA can affect the K M and 
A cat values of OGG1. Using the ACTIVITY software, we 
have shown that the fc cat values are correlated with the 
twist of DNA between the positions —6 and +6. Lower 
but still significant correlation existed with the twist over a 
longer [— 8;+8] stretch, consistent with the importance of 
the twist parameter for the /c cat , but the lower Q value in 
this case may indicate that the twist is less relevant outside 
of the enzyme's footprint. OGG1 kinks and partially 
unwinds DNA to evert the oxoG deoxynucleoside into 
the active site (25), and the increased twist may assist 
the enzyme in these acts. Another possibility is that 
some optimal twist angle is required for the most favor- 
able positioning of DNA phosphates for interactions with 
the DNA-binding groove of OGG1. Some other DNA- 
binding proteins are also sensitive to the twist angle of 
B-DNA. For example, higher twist is associated with 
increased efficiency of transcription initiation from 
E. coli promoters by RNA polymerase (47,48) and with 



preferred places of nucleosome assembly (49). 
Characteristic twist angle patterns are observed for 
binding sites of Spl and p53 transcriptional regulators 
(50) and for eukaryotic ribosomal promoters (51). 

For K M values, a negative correlation was observed 
with the average AG° characterizing the stretch of DNA 
between the positions —10 and +10. The most likely ex- 
planation is that the lower values of AG° are associated 
with a more stable DNA duplex, more resistant to the 
disruption of base pairing effected by OGG1. This 
enzyme is known to search for oxoG lesions flipping out 
normal bases from DNA (27), a process that overall 
would be more efficient in less stable duplexes. 
Additionally, the 'limiting stage' approximation was 
applied for prediction of specific AG° for one step of 
DNA helix, §G°. The calculated 8G° for all ds ODNs 
correlated with the changes in AG° but neither with 
enthalpy nor entropy (Table 4). Therefore, using correl- 
ation of the kinetic parameters with conformational and 
physicochemical DNA properties, we have found empir- 
ical dependencies for estimating the K M and k cat values for 
various ds ODNs containing an oxoG nucleotide on the 
basis of theoretically calculated twist, AG°, or 8G°. 
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